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ABSTRACT 


A strategy to evaluate an expert system is formulated. The 
strategy proposed is based on finding an equivalent classifier to an 
expert system and evaluate that classifier with respect to an opt im al 
classifier, a Bayes classifier. 

This paper shows that for the rules considered that an equivalent 
classifier exists. Also, a brief consideration of meta and meta-meta 
rules is included. Also, a taxonomy of exjDert systems is presented and 
an assertion irwrte that an equivalent classi f ier exists for each type of 
expert system in the taxonomy with associated sets of underlying 
assumptions. 
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1. INTRODUCTION 


It is the purpose of this paper to formulate a formal mathematical 
relationship between an expert system and a classifier. If indeed the 
relationship is unique and constructable, then one will be able to 
evaluate an expert system by comparing that system to a Bayesian clas- 
sifier which is by definition the best estimate. 

In order to formulate a general result one must develop a defini- 
tion of an expert system and a taxonomy of special types of expert 
systems. I have selected my glossary from a document prepared for 
Electric Bower Research Institute entitled, Approaches to the 
Verification and Validation of Expert Systems [1], 

The major purpose of this paper is to develop a theoretical 
foundation for evaluating expert systems. 

2. DESORPTION OF EXPERT SYSTEMS 

A number of definitions of expert systems are found in Artificial 
Intelligence literature, tut the most informative, yet concise one 
seems to be the following. First, they perform large, tediously 
complicated and sometimes difficult tasks at expert levels of per- 
formance. Second, they emphasize domain specific problem-solving 
strategies over the more general 'weak methods' of AI. Third, they 
employ rules of self knowledge to dynamically consider their own 
inference process and provide explanations or justifications for 
conclusions reached. 

An expert system usually consists of four main parts, as shewn in 
Figure 1. 

1. Knowledge Base(s) of structured domain facts and their 
relationships and heuristics (problem-solving rules) , 

2. Inference Engine (s) for controlling the application of facts, 
relationships, and heuristics in solving the problem (s) at hand, 
usually under the control of an overall Meta-Controller module, 

3. Problem Data Base of information about the problem being 
solved and the history of the solution process, and 

4. User Interface providing results/status displays, explana- 
tions, and interaction facilities for user inputs. 

Either AI or Conventional software techniques can be used for the 
storage management of the problem data base and the control of the use 
interface, but AI prog ra mming techniques are always used for repre- 
senting the knowledge base(s) and developing the inference engine (s) . 

In the sections that follow, the differences between expert 
systems and conventional software and the types of expert systems, with 
respect to the ease of verifying and validating them, will be dis- 
cussed. 

2.1 Differences Fran Conventional Software 

Ccnputer scientist claim that expert systems differ from conven- 
tional software programs both in the types of problems they solve and 
their internal structure. Like a human expert, the expert system has 
to accomodate information that is incomplete, erroneous, or mislead- 
ing. Yet, a choice of an action or decision among several alternatives 
must be made. They also claim that a conventional software program 
works correctly only when inputs are complete, of the proper syntax. 
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and the problem is anbiguous. These claims are not at all obvious to 
others in other discipline who view AI and AI tools as simply conputer 
language and an approach to helping solve important engineering 
problems. However, expert systems and their rule base does pose sane 
new specific problems in evaluation of software, The problems expert 
systems are designed to solve generally fall into the following 
categories: interpretation, prediction, diagnosis, design, planning, 

monitoring, debugging, repair, instruction and control. Notice, each 
of these categories contain a function of observing, recognizing and 
may include a call for action. 

Expert systems have many structural similarities with conventional 
software in the modules which perform conventional tasks such as input 
problem data processing, data management, and user interface display 
processing. However, the core of expert systems, including the 
knowledge bases and inference engines, can be different. Symbolic 
representation techniques are primarily used to represent knowledge 
versus conventional software's numeric or table— based techniques for 
representing information. These symbolic representations include 
information about relationships between data items and data item 
aggregations and information about heuristics (procedures and rules) 
for problem solving, which are not represented in conventional data 
bases. Expert systems are developed to solve problems which may not 
have easily formalized or algorithmic solutions, so the problem-solving 
apparatus (Inference engines) must use urxxawentional methods, in- 
cluding heuristic-guided search, symbolic inferencing, generation and 
test of solutions, and constraint-based reasoning. 

It is these differences from conventional software which make 
expert systems interesting and applicable, and yet also less easily 
tested, verified and validated. Because their solution methods are 
often potentially unbounded and not prescribed in a strai^itforward 
"recipe" format like conventional solution methods, they are therefore 
harder to "prove" correct. 

2.2 Expert System Types 

There are many ways of dividing expert systems into types, by 
application, by method of reasoning, etc. Examples of these groupings 
are found in Table 2-1 and Figure 2-1. In this discussion, expert 
systems will be divided into types relative to the complexity and 
difficulty of performing V&V on each type. The types are listed in 

Table 2-0. 

TABLE 2-0.- EXPERT SYSTEM TYPES 

NUMBER NAME 

1 Simple, based on Codified Knowledge 

2 Simple with Uncertainty Handling 

3 Simple, based on Elicited Knowledge 

4 Elicited with Uncertainty Handling 

5 Ocnplex 

6 Ocnplex with Uncertainty Handling 

The first type of expert system. Simple, is developed through the 
straightforward encoding of validated and verified decision tables 
and/or procedure trees. Its search space is small and examined with 
exhaustive search techniques or large and factorable and examined with 


21-4 




21-5 


Figure 1-1. Typical Expert System Components 






Table 2-1 


GENERIC CATEGORIES OF KNOWLEDGE 
ENGINEERING APPLICATIONS 


Category 

Problem Addressed 

Interpretation 

Inferring situation descriptions from sensor data 

Prediction 

Inferring likely consequences of given situations 

Diagnosis 

Inferring system malfunctions from observables 

Design 

Configuring objects under constraints 

Planning 

Designing actions 

Monitoring 

Comparing observations to plan vulnerabilities 

Debugging 

Prescribing remedies for malfunctions 

Repair 

Executing a plan to administer a prescribed remedy 

Instruction 

Dianosing, debugging, and repairing student behavior 

Control 

Interpreting, predicting, repairing, and monitoring 
system behaviors 
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Figure 2-1. Expert System Types Based on Reasoning Methods 
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enumerative search techniques, for which proofs exist of the best 
answer. A single line of reasoning is used and reasoning is monotonic, 
approaching a single, best answer. This type rou^ily corresponds to 
types 1,3, and 4 in Figure 2-1. 

The most important feature besides simplicity for this type is 
that the knowledge to be encoded into Idle expert system has already 
been written down in some form and tested for c o rrec t ness. If the 
codified decision tables or procedure trees must be augmented with 
nan-proved knowledge, then the expert system does not fall into this 
type category. So, V&V of this type of expert system involves merely 
proving that it correctly represents and uses the knowledge, not that 
the knowledge itself is correct. 

The second type of expert system. Simple with Uncertainty 
Handling, has the features of simplicity and proven knowledge in com- 
mon with the first, but also includes certainty factors, fuzzy logic, 
probabilities, or some other method of dealing with uncertain infor- 
mation. This type of expert system roughly corresponds to type 2 in 
Figure 2-1. There are two general types of uncertainty, uncertainty 
on the existence or value of knowledge items or their relationships 
and uncertainty on the rules, reflecting the expert's imxertainty of 
the applicability of the rule on the antecedent conditions or the 
appropriateness of the conclusions. Expert systems may incorporate 
either or both forms of uncertainty. There are various methods of 
combining certainty factors which may be used alone or in combination 
in an expert system. The decision of which combination function(s) 
to use is ccnplex and requires analysis and judgment on the part of 
the knowledge engineer and expert and effects the reliability of the 
system. 

Even if there exist pre-proven uncertainty values on knowledge 
items, or rules, the method of combining those factors most-likely is 
not pre-proven and tested. And, usually, the values themselves do 
not pre-exist and must be elicited from experts. An example is the 
Probability Risk Assessment (IRA) prior event probabilities. Even 
though these probabilities pre-exist and have been determined to be 
correct, to the best knowledge of experts, hew the probabilities can 
be combined and what probabilities to place on rules have not been 
predetermined and tested. Therefore, uncertainty handling compli- 
cates the V&V of simple expert systems by adding the need to V&V the 
certainty factors and their combination method(s) . 

The third type of expert system, Simple based on Elicited 
Knowledge, is probably the most common type in existence. The sim- 
plicity of the previous two types is still evident, probably because 
an expert system building tool or environment was used to develop the 
system. However, the knowledge (items and rules) , or sane portion of 
it, does not pre-exist in a tested form and must be elicited from 
usually one expert. Successive prototypes are developed and tested 
to verify that the knowledge required to solve the problem is actually 
being elicited and that it is encoded properly. 

The fourth type of expert system. Elicited with Uncertainty 
Handling, is a combination of the second and third. Its knowledge has 
been elicited from an expert and it includes certainty factors. An 
example of this type of expert system is the Mycin system in the field 
of m wriirai diagnosis developed by Stanford University. V&V of this 
type of expert system would include testing the correctness of the 
knowledge, the correctness of the knowledge implementation, the 
correctness of the uncertainty factors, and the correctness of the 
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uncertainty combination functions (s) . 

The fifth type of expert system, Complex, eliminates all or some 
of the simplicity assumptions of the previous four, as listed in the 
description of the first type of expert system. The search space (s) 
can be infinite or large and unfactorable, subproblems within the 
problem may interact in various ways, constraint-based reasoning may be 
used to limit search and multiple lines of reasoning may be pursued to 
independently produce candidate answers. This type of expert system 
roughly corresponds to types 5-11 in Figure 2-1. Usually the knew-, 
ledge Is elicited f rom more than one expert, so conflicting heuristics 
may arise. Also, visually expert system building tools are not used to 
implement these systems because the tools do not allow the complexities 
to be built in. 

Examples of the fifth type of expert system include autonomous 
vehicle control and battle management applications, such as those in 
the Strategic Computing Program funded by the Defense Advanced Research 
Projects Agency (DARPA, 1986) . Expert systems of this type are just 
now beginning to work in a primitive fora, and V&V of these systems is 
still a research issue. It is very unlikely that this type of expert 
system will be developed for non-military application ary time in the 
near future, so V&V of this type of expert system will not be discus- 
sed. 

The sixth type of expert system. Complex with Uncertainty 
Handling, arHa the complicating factor of uncertainty handling to the 
fifth, making V&V even more impossible and even more of a research 
issue. This type also will not be discussed. 

3. THE MAIN RESULT 

Consider a simple expert system represented by the following rules 
involving two conditions C, and C_ which can occur or fail to occur and 
four actions Ag, A^, Aj, and A^. 


RO 

If CjAC^, 

then Aq, 

Rl 

If ^ , 

then A^ 

R2 

If (^AOj, 

then A^ and 

R3 

If ^ACj, 

then A-j, 


which can also be represented in vector notation 


RO 

If 

(0, 

0), 

then 

(0) 

Rl 

If 

(0, 

1), 

then 

(1) 

R2 

If 

(1, 

0), 

then 

(2) 

R3 

If 

(If 

1), 

then 

(3) 


Note that the set of rules { RO, Rl, R(2) , R(3) } can be thought of and 
modeled by a function R 

R: (r^ r 2 ) { 0,1,2, 3 ) 

where r- and r 2 € { 0,1 ) 
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Suppose further that 


Pq-pt [C^Cj] , 

P x = Pr [^A^] »■ 

p 3 - Pr [^AC^] , and 

P4 = Pr [C^A^] 1 

are the a priori probabilities that the system are in the respective 
states, 

S 0 ” ' 

S 1 = c i aG 2 ' 

5 3 = Cj^a Cj , and 

5 4 = c 1 aC 2 ' 

Also let data be taken to make a decision as to which state the 
system belongs. The data can be vector valued, dependent or indepen- 
dent, known exactly (deterministic) or observable (random) . In order 
to determine if C x occurs, we follow the usual logic of statistical 

classification theory [2]. Here if 

X = (X^ X^ ..., Xp) T and Y * (Y r Y 2 , ..., Y q ) T 

denotes the observations, then the recognition rules are 

KR 1 If X # 1^, then - 0. 

If X ^ 1^, then = 1. 

RR 2 If Y ^ Rj, then Cj = 0. 

If Y $ Rj, then = 1. 

We \ose the notation Cj to denote that we estimate the value of Cj 

and do not know that value exactly. Ihe classification regions Rl and 
R 2 are selected by the expert or opti m a l ly as Bayes regions. 

Note that the set of rules when described as above is equivalent 
to a classical statistical classification problem. We knew the solu- 
tion to problem; that is, the best classifier is a Bayes classi- 
fier. A Bayes classifier is one in which the regions and Rj are 

selected judiciously to assure that the expected costs of misdassi- 
ficaticn are minimized. 

Probabilities that an expert system ocomits errors can now be 
rmpri-Pri when the probability density functions f(x) and f(y) of X and 
Y are known. That is, 

Pr [C x = 0 = 1] = ^ f x (x)dx 

*1 

Pr [C 2 = 0 l C 2 = 1] = J f 1 (y)dy 

*2 
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and the probability of not making an error is given 

Pr [C x = 0 ] = 0] = I f Q (x)dx 

*1 

Pr [ C 2 = 0 ( C 2 = 0] = J f Q (y)dx 

*2 

Note that if X and Y are independent then various combined prob- 
abilities can be computed easily. If X and Y are not independent then 
the joint probability density function is required. T 

One can stack the observations into a new random vector Z = (x,y) 
and establish Z as the data vector. This form allows one to consider 
the effect of the covariance structure of X and Y. 

Suppose one introduce some "beliefs” concerning the rules. Let us 
model them in the following form. 

RB 0 Pr[R0 is true] = qQ 

RB 1 Pr[Rl is true] = q-j^ 

RB 2 Pr[R2 is true] = q 2 

EB 3 Pr[R3 is true] = q 3 

This in many applications can affect the action, that is, the 
actions are modified to 

RO If C^A C 2 , then PtCAq] = 

and Pr[A^] = l-q^ 

R1 If C^A C^, then Pr^] = q 3 

and Pr[A^] = l-q r 

R2 If C^A C 2 , then PrC^] = q 2 

and Pr[A^] = l-q 2 , and 

R3 If then PIAj] = q 3 

and Pr[A^] = l-q 3 

These rule can be modelled as stochastic actions which are applied 
with a specified probability. This formulation introd uce s the concept 
of stoc hastic actions. This in turn generates a requirement for con- 
flict resolution rules for handling uncertainties of this type. To 
exanple consider the following two types of conflict resolution rules. 

CRRl: If q > 1-q, then select action A, if q $ 1-q, select action iJ . 

CRR2: If the ejected cost of Aq is greater than expected cost of A^, 

select A£; if not Aq. 
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CKO: Generate a uniform random nunfcer N on the interval [ 0 , 1 ], if 0 < 

N£q then select action Agj if q<N51, then select A^. 

The first two rules are deterministic rules while CKO is a sto- 
chastic rule. Clearly, these rules can be extended to more than two 
actions per rule. 

What we have done is given a precise formulation of an expert 
system which in turn gives a mathematical model for evaluating the 
expert system as ccn^ared with on optimal Bayesian classifier. 

Let us new consider the problem of evaluation. Let 

= {X; X<X<X) 

1*2 = {Y; YiY 5Y) 

then we see that directly 

Y 



Ficmre 3.1. The (X,Y) plane partitioned into action regions based 
on an. Expert System if 

X £ 91x1 Y ^ ** 2 * then Aq 

X £ and Y ^ Rj, then A^ 

X ^ R^ and Y % Rj, then A 2 

X^andY*^, then *3 

«here ^ and % are the regions defined by the expert. 

However, a Bayesian classifier would give different regions R^and 
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Figure 3.2. The (X,Y) plane partition into action regions based 
on Bayesian classifier. 

Now consider the following meta rule and a meta-meta rule: 

MR1: If after K trials 


Ci^Cj occurs Kq ti m es 
CfAC^ occurs ti m es 
occurs Kj times 
C 1^2 oocurs K3 times 

then replace ( a's are preselected known parameters) , 

P 0 with a Q P 0 + (l-a Q ) VK 
P 1 with a 1 P 1 + (l-a 1 ) Kj/K 
P 2 with ajPj + (l-aj) Kj/K 
P 3 with a^ + (l-eLj) Kj/K 
and carpute new regions and Rj. 


MMR1: Let K(j) * K + j 50. 

The argument appears reasonable that an expert system can be 
modelled mathematically by finding an equivalent classifier and the 
expert system can then be evaluated by evaluating that classifier. 


4. CONCLUDING REMARKS 

The types of expert systems listed in Table 2.2 and discussed in 
Section 2 of this paper may new be defined precisely by defining an 
associated classifier when different amount and/or kind of knowledge 
differs. 

This allows evaluation of expert systems to be done in a direct 
way compatible with the theory of classification. 
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